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Abstract The lepton identification is essential for the physics 
programs at high-energy frontier, especially for the precise 
measurement of the Higgs boson. For this purpose, a Toolkit 
for Multivariate Data Analysis (TMVA) based lepton iden- 
tification (LICH') has been developed for detectors using 
high granularity calorimeters. Using the conceptual detector 
geometry for the Circular Electron-Positron Collider (CEPC) 
and single charged particle samples with energy larger than 
2 GeV, LICH identifies electrons/muons with efficiencies 
higher than 99.5% and controls the mis-identification rate of 
hadron to muons/electrons to better than 1%/0.5%. Reduc- 
ing the calorimeter granularity by 1-2 orders of magnitude, 
the lepton identification performance is stable for particles 
with E> 2 GeV. Applied to fully simulated eeH/u WH events, 
the lepton identification performance is consistent with the 
single particle case: the efficiency of identifying all the high 
energy leptons in an event, is 95.5-98.5%. 


1 Introduction 


After the Higgs discovery, the precise determination of the 
Higgs boson properties becomes the focus of particle physics 
experiments. Phenomenological studies show that the physics 
at TeV scale would be revealed if the Higgs couplings could 
reach the percent level measurement accuracy[1][2]. 

The LHC is a powerful Higgs factory. However, the pre- 
cision of Higgs measurements at the LHC is limited by the 
huge QCD background, the large theoretical and systemati- 
cal uncertainties. In addition, the Higgs signal at the LHC is 
usually tagged by the Higgs decay products, making those 
measurements always model dependent. Therefore, the pre- 
cision of Higgs couplings at the HL-LHC is typically limited 
to 5-10% level depending on theoretical assumptions [3][4]. 
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In terms of Higgs measurements, the electron-positron 
colliders play a role complementary to the hadron collid- 
ers with distinguishable advantages. Many electron-positron 
Higgs factories have been proposed, including the Interna- 
tional Linear Collider (ILC), the Compact LInear Collider 
(CLIC), the Future e+e- Circular Collider (FCC-ee) and the 
CEPC [1][5][6]. These proposed electron-positron Higgs fac- 
tories pick and reconstruct Higgs events with an efficiency 
close to 100%, and determine the absolute value of the Higgs 
couplings. Compared to the LHC, these facilities have much 
better accuracy on the Higgs total width measurements and 
Higgs exotic decay searches, in addition the accuracies of 
Higgs measurements are dominated by statistic errors. For 
example, the circular electron-positron collider (CEPC) is 
expected to deliver | million Higgs bosons in its Higgs op- 
eration, with which the Higgs couplings will be measured to 
percent or even per mille level accuracy[6]. 

The lepton identification is essential to the precise Higgs 
measurements. The Standard Model Higgs boson has roughly 
10% chance to decay into final states with leptons, for exam- 
ple, H> WW* —llvv/lvqq, H-ZZ*—llqq, H- tt, H> 
uu, etc. The SM Higgs also has a branching ratio Br(H—>bb) 
= 58%, while the lepton identification provides an impor- 
tant input for the jet flavor tagging and the jet charge mea- 
surement. On top of that, the Higgs boson has a significant 
chance to be generated together with leptons. For example, 
in the ZH events, the leading Higgs generation process at 
240-250 GeV electron-positron collisions, about 7% of the 
Higgs bosons are generated together with a pair of leptons 
( Br(Z—ee) and Br(Z— uu) = 3.36% ). At the electron- 
positron collider, ZH events with Z decaying into a pair of 
leptons is regarded as the golden channel for the HZZ cou- 
pling and Higgs mass measurement[7]. Furthermore, lep- 
tons are intensively used as a trigger signal for the proton 
colliders to pick up the physics events from the huge QCD 
backgrounds. The Particle Flow Algorithm (PFA) becomes 


the paradigm of detector design for the high energy frontier[8, 
9,6, 12]. The key idea is to reconstruct every final state par- 
ticle in the most suited sub-detectors, and reconstruct all 
the physics objects on top of the final state particles. The 
PFA oriented detectors have high efficiency in reconstruct- 
ing physics objects such as leptons, jets, and missing energy. 
The PFA also significantly improves the jet energy resolu- 
tion, since the charged particles, which contribute the ma- 
jority of jet energy, are usually measured with much better 
accuracies in the trackers than in the calorimeters [14,9, 10, 
11,131]: 

To reconstruct every final state particle, the PFA requires 
excellent separation by employing highly-granular calorime- 
ters. In the detector designs of the International Large De- 
tector (ILD) or the Silicon Detector (SiD) [1,15], the total 
number of readout channels in calorimeters reaches the 108 
level. In addition to cluster separation, detailed spatial, en- 
ergy and even time information on the shower developments 
is provided. An accurate interpretation of this recorded in- 
formation will enhance the physics performance of the full 
detector [16]. 

Using the information recorded in the high granularity 
calorimeter and the dE/dx information recorded in the tracker, 
LICH(Lepton Identification in Calorimeter with High gran- 
ularity), a dedicated lepton identification algorithm for Higgs 
factories has been developed. Using CEPC conceptual de- 
tector geometry [6](based on ILD) and the Arbor[14] recon- 
struction package, its performance is tested on single parti- 
cles and physics events. For the single particles with energy 
higher than 2 GeV, LICH reaches an efficiency better than 
99.5% in identifying the muons and the electrons, and 98% 
for pions. Its performance on physics events (eeH/ uH) and 
the final efficiency agrees with the efficiency at the single 
particle level. 

This paper is organized as follows. The detector geom- 
etry and the samples are presented in section 2. In section 
3, the discriminant variables measured from charged recon- 
structed particles are summarized and the algorithm archi- 
tecture is presented. In section 4, the LICH performance on 
single particle events is presented. In section 5, the corre- 
lations between LICH performance and the calorimeter ge- 
ometry are explored. In section 6, the LICH performance on 
ZH events where Z decays into ee or uu pairs is studied, the 
results are then compared with that of single particle events. 
In section 7, the results are summarized and the impact of 
calorimeter granularity is discussed. 


2 Detector geometry and sample 


In this paper, the reference geometry is the CEPC conceptual 
detector [6], which is developed from the ILD geometry [1]. 
ILD is a PFA oriented detector meant to be used for centre of 
mass energies up to 1 TeV. It is equipped with a low material 


tracking system and a calorimeter systems with extremely 
high granularity. 

In this CEPC conceptual detector design, the forward re- 
gion, and the yoke thickness have been adjusted to the CEPC 
collision environment with respect to the ILD detector. The 
core part of this detector is a large solenoid of 3.5 Tesla. 
The solenoid system has an inner radius of 3.4 meters and a 
length of 8.05 meters, inside which both tracker and calori- 
meter system are installed. The tracking system is composed 
of a TPC as the main tracker, a vertex system, and the sili- 
con tracking devices. The amount of material in front of the 
calorimeter is kept to ~ 5% radiation length. Both ECAL 
and HCAL use sampling structures and have extremely high 
granularity. The ECAL uses tungsten as the absorber and 
silicon for the sensor. In depth, the ECAL is divided into 
30 layers and in the transverse direction, each layer is di- 
vided into 5 by 5 mm? cells. The HCAL uses stainless steel 
absorber and GRPC(Glass Resistive Plate Chamber) sensor 
layers. It uses 10 by 10 mm? cells and has 48 layers in total. 

As a Higgs factory, the CEPC will be operated at 240- 
250 GeV center of mass energy. To study the adequate lep- 
ton identification performance, we simulated single particle 
samples (pion+, muon-, and electron-) over an energy range 
of 1-120 GeV (1, 2, 3, 5, 7, 10, 20, 30, 40, 50, 70, 120 GeV). 
At each energy point, 100k events are simulated for each par- 
ticle type. These samples follow a flat distribution in theta 
and phi over the 47 solid angle. 

These samples are reconstructed with Arbor (version 3.3). 
To disentangle the lepton identification performance from 
the effect of PFA reconstruction and geometry defects, we 
select those events where only one charged particle is re- 
constructed. The total number of these events is recorded 
as Nj particle, and the number of these events identified with 
correct particle types is recorded as Nj particle. r. The perfor- 
mance of lepton identification is then expressed as a migra- 
tion matrix in Table 2, its diagonal elements ei refer to the 
identification efficiencies (defined as Nj particie.t /N1Particle), 
and the off diagonal element Pi represent the probability of 
a type i particle to be mis-identified as type j. 


Table 1 Migration Matrix 
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3 Discriminant variables and the output likelihoods 


LICH takes individual reconstructed charged particles as in- 
put, extracts 24 discriminant variables for the lepton iden- 


tification, and calculates the corresponding likelihood to be 
an electron or a muon. These discriminant variables can be 
characterized into five different classes: 


dE/dx 

For a track in the TPC, the distribution of energy loss per 
unit distance follows a Landau distribution. The dE/dx 
estimator used here is the average of this value but after 
cutting tails at the two edges of the Landau distribution 
(first 7% and last 30%). The dE/dx has a strong discrim- 
inant power to distinguish electron tracks from others at 
low energy (under 10 GeV) (Figure 1). 
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Fig. 1 dE/dx for e~, u~ and a+, for electrons it is stable around 2.4 x 


107 


7, for muon and pion it is smaller at energy lower than 10 GeV and 


after that they start mixing with electron 


Fractal Dimension 

The fractal dimension (FD) of a shower is used to de- 
scribe the self-similar behavior of shower spatial con- 
figurations, following the original definition in [16], the 
fractal dimension is directly linked to the compactness 
of the particle shower. 

At a fixed energy, the EM showers are much more com- 
pact than the muon or hadron shower, leading to a large 
FD. The muon shower usually takes the configuration of 
a 1-dimensional MIP(Minimum Ionizing Particle) track, 
therefore has a FD close to zero. The FD of the hadronic 
shower usually lays between the EM and MIP tracks, 
since it contains both EM and MIP components. A typ- 
ical distribution of FD for 40 GeV showers is presented 
in Figure 2, 

For any calorimeter cluster, LICH calculates 5 different 
FD values: from its ECAL hits, HCAL hits, hits in 10 or 
20 first layers of ECAL, and all the calorimeter hits. 


Energy Distribution 

LICH builds variables out of the shower energy infor- 
mation, including the proportion of energy deposited in 
the first 10 layers in ECAL to the entire ECAL, or the 
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Fig. 2 Fractal dimension using both ECAL and HCAL for e~, u~ and 
m” at 40 GeV 


energy deposited in a cylinder around the incident direc- 
tion with a radius of 1 and 1.5 Moliere radius. 


— Hits Information 
Hits information refers to the number of hits in ECAL 
and HCAL and some other information obtained from 
hits, such as the number of ECAL (HCAL) layers hit by 
the shower, number of hits in the first 10 layers of ECAL. 


— Shower Shape, Spatial Information 

The spatial variables include the maximum distance be- 
tween a hit and the extrapolated track, the maximum dis- 
tance and average distance between shower hits and the 
axis of the shower (defined by the innermost point and 
the center of gravity of the shower), the depth (perpen- 
dicular to the detector layers) of the center of gravity, and 
the depth of the shower defined as the depth between the 
innermost hit and the outermost hit. 


The correlation of those variables at energy 40 GeV are 
summarized in Figure 3, the definitions of all the variables 
are listed in Appendix A. It is clear that the dE/dx, mea- 
sured from tracks, does not correlate with any other vari- 
ables which are measured from calorimeters. Some of the 
variables are highly correlated, such as FD_ECAL (FD cal- 
culated from ECAL hits) and EcalNHit (number of ECAL 
hits). However all these variables are kept because their cor- 
relations change with energy and polar angle. 


LICH uses TMVA[17] methods to summarize these in- 
put variables into two likelihoods, corresponding to elec- 
trons and muons. Multiple TMVA methods have been tested 
and the Boosted Decision Trees with Gradient boosting (BDTG) 
method is chosen for its better performance. The e-likeness 
(Le) and u-likeness (Lp) for different particles in a 40 GeV 
sample are shown in Figure 4. 
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Fig. 3 The correlation matrix of all the variables 
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Fig. 4 The e-likeliness and p-likeness of e~, u~ and z* at 40 GeV, 
grey lines are the cuts for different catalogs in next section 


4 Performance on single particle events 


The phase space spanned by the lepton-likelihoods (Le and 
Ly) can be separated into different domains, corresponding 
to different catalogs of particles. The domains for particles 
of different types can be adjusted according to physics re- 
quirements. In this paper, we demonstrate the lepton iden- 
tification performance on single particle samples using the 
following catalogs: 


— Muon: L, > 0.5 

— Electron: Le > 0.5 

— Pion: 1-(Ly+L,)> 0.5 

— Undefined: Ly <0.5 & Le < 0.5 & 1-(Lu+Le) < 0.5 


— barrel 1: middle of barrel (| cos 0| < 0.3), 

— barrel 2: edge of barrel (0.3 < | cos 0| < 0.7), 

— overlap: overlap region of barrel and endcap (0.7 < | cos 0| 
< 0.8), 

— endcap: (0.8 < |cos 0| < 0.98). 


Take the sample of 40 GeV charged particle as an ex- 
ample, the migration matrix is shown in Table 2. Comparing 
this table to the result of ALEPH for energetic taus[18], the 
efficiencies are improved, and the mis-identification rates 
from hadrons to leptons are significantly reduced. 


Table 2 Migration Matrix at 40 GeV (%) 


Type e` like u` like 1" like 
e 99.71 +0.08 < 0.07 0.21 +0.07 
U` <0.07 99.87 +0.08 0.05 +0.05 
mT 0.14 0.05 0.35+0.08 99.26 +0.12 


The lepton identification efficiencies (diagonal terms of 
the migration matrix) at different energies are presented in 
Figure 5 for the different regions. The identification efficien- 
cies saturate at 99.9% for particles with energy higher than 
2 GeV. For those with energy lower than 2 GeV, the perfor- 
mance drops significantly, especially in barrel2 and overlap 
regions. For the overlap region, the complex geometry lim- 
its the performance; while for the barrel2 region, charged 
particles with Pt < 0.97 GeV cannot reach the barrel, they 
will eventually hit the endcaps at large incident angle, hence 
their signal is more difficult to catalog. 

Concerning the off-diagonal terms of the migration ma- 
trix, the chances of electrons to be mis-identified as muons 
and pions are negligible (P£, PE < 107°), the crosstalk rate 
PH is observed at even lower level. However, the chances 
of pions to be mis-identified as leptons (P7, P7) are of the 
order of 1% and are energy dependent. In fact, these mis- 
identifications are mainly induced by the irreducible physics 
effects: pion decay and 2° generation via 2-nucleon colli- 
sion. Meanwhile, the muons also have a small chance to be 
mis-identified as pions at energy smaller than 2 GeV. Figure 
6 shows the significant crosstalk items (P7, Pjand PË) as 
a function of the particle energy in the endcap region. The 
green shaded band indicates the probability of pion decay 
before reaching the calorimeter, which is roughly compara- 
ble with Př. 


5 Lepton identification performance on single particle 


The probabilities of undefined particles are very low (<107°) events for different geometries 


at single particle samples with the above catalog. 

Since the distribution of these variables depends on the 
polar angle of the initial particle (0), the TMVA is trained 
independently on four subsets: 


The power consumption and electronic cost of the calorim- 
eter system scale with the number of readout channels. It’s 
important to evaluate the physics performance for different 
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Fig. 5 The efficiency of lepton identification for e~, y~ and a* as 


function of particle energy in the four regions 
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Fig. 6 The mis-identification rates of lepton identification for u and 
m in ~ 5000 events for the endcap region; Pion decay rate band (to 
account for the polar angle spread) is indicated for comparison 


calorimeter granularities, at which the LICH performance is 
analyzed. 

The performance is scanned over certain ranges of the 
following parameters: 


the number of layers in ECAL, taking the value of 20, 
26, 30; 

the number of layers in HCAL: 20, 30, 40, 48; 

the ECAL cell size = 5x5 mm?, 10x10 mm’, 20x20 
mm?, 40x40 mm? 

HCAL cell size = 10x10 mm?, 20x20 mm?, 40x40 
mm’, 60x60 mm?, 8080 mm? 


In general, the lepton identification performance is ex- 
tremely stable over the scanned parameter space. Only for 
HCAL cell size larger than 60x60 mm? or HCAL layer 


number less than 20, marginal performance degradation is 
observed: the efficiency of identifying muons degrades by 
1-2% for low energy particles (E < 2 GeV), and the iden- 
tification efficiency of pion degrades slightly over the full 
energy range, see Figure 7. 
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Fig. 7 The efficiency of lepton identification for two different geome- 
tries 


6 Performance on physics events 


The Higgs boson is mainly generated through the Higgsst- 
rahlung process (ZH) and more marginally through vector 
boson fusion processes at electron-positron Higgs factories. 
A significant part of the Higgs bosons will be generated to- 
gether with a pair of leptons (electrons and muons). These 
leptons are generated from the Z boson decay of the ZH pro- 
cess. For the electrons, they can also be generated together 
with Higgs boson in the Z boson fusions events, see Figure 
8. At the CEPC, 3.6 x 10+ uuH events and 3.9 x 10+ eeH 
events are expected at an integrated luminosity of 5 ab~!. In 
these events, the particles are rather isolated. 


Fig. 8 Feynman diagrams of major Higgs production with leptons at 
CEPC: the Higgsstrahlung and ZZ fusion processes. 


The eeH and uuH events provide an excellent access to 
the model-independent measurement to the Higgs boson us- 
ing the recoil mass method [7]. The recoil mass spectrum of 
eeH and uuH events is shown in Figure 9, which exhibits a 
high energy tail induced by the radiation effects (ISR, FSR, 
bremsstrahlung, beamstrahlung, etc), while in CEPC the be- 
amstrahlung effect is negligible. The bremsstrahlung effects 
for the muons are significantly smaller than that for the elec- 
trons, therefore, it has a higher maximum and a smaller tail. 
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Fig. 9 The recoil mass spectrum of ee/uu, low energy peak in eeH 
corresponds to the Z fusion events 


Figure 10 shows the energy spectrum for all the recon- 
structed charged particles in 10k eeH/ uH events. The lep- 
tons could be classified into 2 classes, the initial leptons 
(those generated together with the Higgs boson) and those 
generated from the Higgs boson decay cascade. For the eeH 
events, the energy spectrum of the initial electron exhibits 
a small peak at low energy, corresponding to the Z fusion 
events. The precise identification of these initial leptons is 
the key physics objective for the lepton identification per- 
formance of the detector. 
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Fig. 10 Energy Spectrum of charged particles in eeH event at 250 GeV 
center of mass energy 


Since the lepton identification performance depends on 
the particle energy, and most of the initial leptons have an 
energy higher than 20 GeV, we focused on the performance 
study of lepton identification on these high energy particles 
at detectors with two different sets of calorimeter cell sizes. 

The p-likeliness and e-likeliness of electrons, muons, 
and pions, for eeH events and uuH events are shown in 
Figure 11 and Figure 12. Table 3 summarizes the definition 
of leptons and the corresponding performance at different 
conditions. The identification efficiencies for the initial lep- 
tons is degraded by 1-2% with respect to the single parti- 
cle case. This degradation is mainly caused by the shower 
overlap, and it’s much more significant for electrons as elec- 
tron showers are much wider than that of muon, leading to a 
larger chance of overlapping. The electrons in 4uH events 
and vice versa, are generated in the Higgs decay. Their iden- 
tification efficiency and purity still remains at a reasonable 
level. For charged leptons with energy lower than 20 GeV, 
the performance degrades by about 10% because of the high 
statistics of background and the cluster overlap. The event 
identification efficiency, which is defined as the chance of 
successfully identifying both initial leptons, is presented in 


the last row of Table 3. The event identification efficiencies 
is roughly the square of the identification efficiency of the 
initial leptons. Comparing the performance of both geome- 
tries, it is shown that when the number of readout channels is 
reduced by 4, the event reconstruction efficiency is degraded 
by 1.3% and 1.7%, for uuH and eeH events respectively. 


7 Conclusion 


The high granularity calorimeter is a promising technology 
for detectors in collider facilities of the High Energy Fron- 
tiers. It provides good separation between different final state 
particles, which is essential for the PFA reconstructions. It 
also records the shower spatial development and energy pro- 
file to an unprecedented level of details, which can be used 
for the energy measurement and particle identifications. 

To exploit the capability of lepton identification with 
high granularity calorimeters and also to provide a viable 
toolkit for the future Higgs factories, LICH, a TMVA based 
lepton identification package dedicated to high granular ca- 
lorimeter, has been developed. Using mostly the shower de- 
scription variables extracted from the high granularity ca- 
lorimeter and also the dE/dx information measured from 
tracker, LICH calculates the e-likeness and p-likeness for 
each individually reconstructed charged particle. Based on 
these output likelihoods, the leptons can be identified ac- 
cording to different physics requirement. 

Applied to single particle samples simulated with the 
CEPC_v1 detector geometry, the typical identification effi- 
ciency for electron and muon is higher than 99.5% for ener- 
gies higher than 2 GeV. For pions, the efficiency is reaching 
98%. These efficiencies are comparable to the performance 
reached by ALEPH, while the mis-identification rates are 
significantly improved. Ultimately, the performances are lim- 
ited by the irreducible confusions, in the sense that the chance 
for muon to be mis-identified as electron and vice versa is 
negligible, the mis-identification of pion to muon is domi- 
nated by the pion decay. 

The tested geometry uses a ultra-high granularity calo- 
rimeter: the cell size is 1 by 1 cm? and the layer number 
of ECAL/HCAL is 30/48. In order to reduce the total chan- 
nel number, LICH is applied to a much more modest gran- 
ularity, it is found that the lepton identification performance 
degrades only at particle energies lower than 2 GeV for an 
HCAL cell size bigger than 60x60 mm? or with an HCAL 
layer number less than 20. 

The lepton identification performance of LICH is also 
tested on the most important physics events at CEPC. In 
these events, multiple final state particles could be produced 
in a single collision, the particle identification performance 
will potentially be degraded by the overlap between nearby 
particles. The lepton identification on eeH/ uH event at 250 


Table 3 uuH/eeH events lepton identification efficiency 


Geom 1 (ECAL and HCAL Cell Size 10x10 mm?) 


Geom 2 (ECAL and HCAL Cell Size 2020 mm?) 


uuH eeH uuH eeH 
u definition Ly>0.1 Ly>0.1 Ly>0.1 Ly>0.1 
e definition Le>0.01 Ly<0.1 Le>0.001 Ly<0.1 Le>0.01 Ly<0.1 Le>0.001 Ly <0.1 
Ee 93.41 + 0.92 98.64 + 0.08 91.60 + 1.02 97.89 + 0.11 
Ne 92.02 + 1.00 99.74 + 0.04 89.89 + 1.10 99.67 + 0.04 
Eu 99.54+ 0.05 95.53 + 0.76 99.19 + 0.06 86.48 + 1.26 
Nu 99.60 + 0.04 96.31 +0.70 99.83 + 0.03 95.38 +0.81 
Eevent 98.53 + 0.13 97.06 + 0.19 97.24+ 0.18 95.40 + 0.24 


Fig. 12 e-likelihood and p1-likelihood of charged particles with E>20 GeV in uuH event 


GeV collision energy has been checked. The efficiency for a 
single lepton identification is consistent with the single par- 
ticle results. The efficiency of finding two leptons decreases 
by 1~2 % when the cell size doubles, which means that the 
detector needs 2~4% more statistics in the running. In eeH 
events, the performance degrades because the clustering al- 
gorithm still needs to be optimized. 


To conclude, ultra-high granularity calorimeter designed 
for ILC provides excellent lepton identification ability, for 
operation close to ZH threshold. It may be a slight overkill 
for CEPC and a slightly reduced granularity can reach a bet- 
ter compromise. And LICH, the dedicated lepton identifica- 
tion for future e+e- Higgs factory, is prepared. 
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Appendix A: Appendix section 


List and meaning of variables used in the TMVA which are 
not mentioned in the text: 


— NH_ECALF10: Number of hits in the first 10 layers of 
ECAL 

— FD_ECALL20: FD calculated using hits in the last 20 
layers of ECAL 


FD_ECALF10: FD calculated using hits in the first 10 

layers of ECAL 

AL_ECAL: Number of ECAL layer groups (each five 

layers forms a group) with hits 

— av_NHH: Average number of hits in each HCAL layer 
groups (each five layers forms a group) 

— rms_Hcal: The RMS of hits in each HCAL layer groups 
(each five layers forms a group) 

— EEClu_r: Energy deposited in a cylinder around the in- 
cident direction with a radius of 1 Moliere radius 

— EEClu_R: Energy deposited in a cylinder around the in- 
cident direction with a radius of 1.5 Moliere radius 

— EEClu_L10: Energy deposited in the first 10 layers of 
ECAL 

— MaxDisHel: Maximum distance between a hit and the 
helix 

— minDepth: Depth of the inner most hit 

— cluDepth: Depth of the cluster position 

— graDepth: Depth of the cluster gravity center 

— EcalEn: Energy deposited in ECAL 

— avDisHtoL: Average distance between a hit to the axis 
from the inner most hit and the gravity center 

— maxDisHtoL: Maximum distance between a hit to the 
axis from the inner most hit and the gravity center 

— NLHcal: Number of HCAL layers with hits 

— NLEcal: Number of ECAL layers with hits 

— HcalNHit: Number of HCAL hits 

— EcalNHit: Number of ECAL hits 
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